IN THE CLAIMS: 



Please amend the claims as follows: 

1. (Currently Amended) A method for extracting commands and acoustic data in a same 
utterance, comprising the steps of: 

decoding at least one word in acoustic data representing an acoustic signal that 
comprises a human utterance and determining acoustic word boundaries within the acoustic 
data; 

extracting at least one explicit command in a decoded utterance using a current 
recognition vocabulary ; 

extracting, in the decoded utterance using the current recognition vocabulary, at least 
one implicit command related to the at least one explicit command; 

changing to a different recognition vocabulary responsive to the at least one implicit 
command; and 

identifying acoustic data segments in the utterance that are data parts of at least the at 
least one explicit command based on the acoustic word boundaries and using at least the 
different recognition vocabulary . 

2. (Original) The meUiod as recited in claim 1, wherein the step of determining acoustic 
word boundaries includes finding segment boundaries by iteratively comparing the same 
utterance to a plurality of vocabularies. 
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3. (Currently Amended) The method as recited in claim 1, further comprising the step of 
executing the at least one explicit command from the decoded utterance. 

4. (Currently Amended) The method as recited in claim 3, further comprising at least 
one of storing the acoustic data segments and using the acoustic data segments in executing the 
at least one explicit command. 



5. (Original) The method as recited in claim 3, further comprising the step of 
submitting at least one non-command voice data segment for recognition using the recognizer 
vocabulary. 

6. (Original) The method as recited in claim 1, further comprising the step of changing a 
recognizer vocabulary. 

7. (Original) The method as recited in claim 1, further comprising the step of submitting 
the acoustic data segments for recognition when computing resources are available. 

8. (Currently Amended) The method as recited in claim 1, wherein tire step of extracting 
at least one explicit command from the utterance includes employing one or more grammars to 
distinguish the command. 

9. (Original) The method as recited in claim 8, wherein the grammars include a form for 
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extracting information for an order or verbal contract. 

10. (Original) The method as recited in claim 8, wherein the grammars include a form 
for reminding a user to perform a task. 

11. (Original) The method as recited in claim 8, wherein the grammars include a form 
for extracting maximum meaningful length segments under interruption or silence conditions. 

12. (Original) The method as recited in claim 8, wherein the step of using grammars 
includes the step of associating at least one grammar label widi the corresponding segment of 
acoustic data that has been decoded into a command. 

13. (Original) The metiiod as recited in claim 12, wherein the label includes a numerical 
value associated with each command. 

14. (Currently Amended) The method as recited in claim 1, further comprising the step 
of executing the at least explicit command in the utterance using undecoded acoustic data from 
witliin the same utterance. 

15. (Original) A program storage device readable by machine, tangibly embodying a 
program of instructions executable by the machine to perform method steps for processing 
commands and voice data in a same utterance as recited in claim 1. 



16. (Currently Amended) A method for recognizing at least one command and at least 
one segment of acoustic voice data in a same utterance comprising the steps of: 

decoding at least one word in voice data representing the acoustic signal that comprises 
a human utterance and determining the acoustic word boundaries within tiie voice data; 
extracting at least one command from the utterance; and 

associating segments in tire voice data based on the acoustic word boundaries with 
labels ; and 

executing the at least one command utilizing undecoded information in the acoustic 
voice data . 

17. (Original) The metiiod as recited in claim 16, wherein the step of extracting includes 
employing an application, which identifies commands in the utterance in accordance with the 
labels. 

18. (Cancelled) 

19. (Original) The method as recited in claim 16, wherein the step of extracting includes 
the step of storing at least one non-command voice data segment. 

20. (Original) The method as recited in claim 16, wherein the step of extracting includes 
calling a vocabulary for recognizing numbers and recognizing the numbers in the utterance. 
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21. (Previously Presented) The method as recited in claim 16, wherein the step of 
extracting includes extracting acoustic data based on acoustic word boundaries and saving the 
acoustic data for acoustically rendering the acoustic data. 

22. (Previously Presented) The method as recited in claim 16, wherein the step of 
extracting includes extracting acoustic data based on acoustic word boundaries and decoding 
the acoustic data for storage. 

23. (Original) The method as recited in claim 16, wherein the step of associating 
includes the step of changing a recognizer vocabulary and submitting at least one non-command 
voice data segment for recognition. 

24. (Original) The method as recited in claim 16, further comprising the step of 
buffering the utterance to be processed and maintaining the utterance in memory during 
processing of the utterance. 

25. (Previously Presented) The method as recited in claim 16, wherein the step of 
associating segments of the word boundaries of the commands with a label includes employing 
grammars to associate a unique label with each command segment in the utterance. 

26. (Original) The method as recited in claim 25, wherein the label includes a numerical 



value. 



27. (Original) The method as recited in claim 25, wherein the grammars include a form 
for extracting information for an order or verbal contract. 

28. (Original) The method as recited in claim 25, wherein the grammars include a form 
for reminding a user to perform a task. 

29. (Original) The method as recited in claim 25, wherein the grammars include a form 
for extracting maximum meaningful length segments under interruption or silence conditions. 

30. (Original) The method as recited in claim 16, wherein the step of determining the 
acoustic word boundaries includes finding segment boundaries by iteratively comparing the 
same utterance to a plurality of vocabularies. 

31. (Original) A program storage device readable by machine, tangibly embodying a 
program of instructions executable by the machine to perform method steps for recognizing 
commands and voice data in a same utterance as recited in claim 16. 

32. (Original) A system for recognizing commands and voice data in a same utterance 
comprising: 

an acoustic input, which receives utterances; 
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a data buffer configured to store audio data representing the utterances; 

a speech recognition engine configured to match portions of the utterances to acoustic 
models and language models to recognize words and word boundaries in the utterance and 
labels commands in the utterance; 

at least one program that executes label-identified commands and processes remaining 
portions of the utterance including processing audio data parts separately from tbe commands 
using a different vocabulary, the vocabulary being selected in accordance with at least one 
command in the utterance. 

33. (Original) The system as recited in claim 32, wherein the at least one program 
includes a function which searches the utterance for labels output from the speech recognition 
engine to execute a command associated with the label. 

34. (Original) The system as recited in claim 32, wherein, in accordance with each label, 
an audio segment is identified and processed. 

35. (Original) The system as recited in claim 32, wherein the speech recognition engine 
utilizes grammars with labels, which the system uses for assigning labels to decoded 
commands. 

36. (Original) The system as recited in claim 35, wherein the grammars are represented 
in Bachus-Naur Form (BNF). 
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